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ABSTRACT 

The prospects for accomplishing x-ray polarization measurements of astronomical sources have grown in recent years, 
after a hiatus of more than 37 years. Unfortunately, accompanying this long hiatus has been some confusion over the 
statistical uncertainties associated with x-ray polarization measurements of these sources. We have initiated a program 
to perform the detailed calculations that will offer insights into the uncertainties associated with x-ray polarization 
measurements. Here we describe a mathematical formalism for determining the 1- and 2-parameter errors in the 
magnitude and position angle of x-ray (linear) polarization in the presence of a (polarized or unpolarized) background. 
We further review relevant statistics — including clearly distinguishing between the Minimum Detectable Polarization 
(MDP) and the accuracy of a polarization measurement. 
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1. INTRODUCTION 

Only a few experiments have conducted x-ray polarimetry of cosmic sources. About 40 years ago, several rocket 
observations' measured x-ray polarization from the Crab Nebula. The x-ray polarimeter aboard the Orbiting Solar 
Observatory (OSO-8) confirmed this result with a 19-sigma detection" (P = 19.2% ± 1.0%), thus proving the 
synchrotron origin of the x-ray emission. Data from OSO-8 also provided upper limits to polarization for a few x-ray- 
bright targets. The key challenge for broad-band x-ray polarimetry has always been to make an instrument that is highly 
responsive to polarized radiation but does not suffer from background or other systematic effects. In the past decade, 
development of a class of polarimeters based upon the photoelectric effect 3 ' ’ and placed at the focus of an x-ray 
telescope has produced a renewed interest in x-ray polarimetry of cosmic sources. 

A realistic assessment of understanding prospects for performing meaningful measurements of the linear polarization of 
astronomical x-ray sources critically depends upon understanding uncertainties produced by statistical fluctuations in 
polarization data, especially important in the presence of systematic effects. Unfortunately there has been some 
confusion regarding figures of merit used to compare the relative sensitivity of different instruments, and the impact of 
systematic effects that cannot simply be “calibrated out” without potentially adverse impact on the instrument’s 
capability for obtaining scientifically meaningful measurements. 

We previously addressed 6 statistical uncertainties associated with polarization measurements of astronomical sources. 
Subsequently, we undertook a detailed program to investigate the impact of systematic effects on polarization 
measurements. Here (§2) we here summarize calculations of the statistical uncertainty in measuring linear polarization 
in the presence of a (polarized or unpolarized) background. Next (§3) we illustrate results for a hypothetical 
measurement of the polarization of the Crab Pulsar as a function of pulse phase, for polarimeters that may or may not 
spatially resolve the pulsar from the nebula, which serves as a polarized background. We conclude (§4) with a summary 
of current results and a brief discussion of applying our formalism to other systematic effects that affect the 
polarimeter’ s sensitivity and accuracy. 
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2. STATISTICS OF POLARIZATION MEASUREMENTS 


A polarimeter measures linear polarization by determining the amplitude and phase of modulation with azimuthal angle 
(p by analyzing the data for a 2(p signature. As the calculated polarization w depends upon the directly-measured 
(fractional) modulation amplitude a and an instrument-dependent modulation factor /u (the modulation amplitude for a 
100%-polarized beam), we shall work with the modulation amplitude a throughout this paper. After finding a, we can 
then calculate the fractional polarization using 


m = a / /j . 


( 1 ) 


The probability density function (PDF) specifies the distribution of measured parameters about their true values. For 
example, the bivariate PDF for a measurement of the modulation amplitude a and position angle y/, given their 
expectation values (denoted by a caret), is symbolically 


PaAa,V\a,v)- 


( 2 ) 


In Appendix A, we sketch the calculation of the PDF and derived statistical properties — minimum detectible amplitude 
(MDA) and univariate uncertainties ( da, 8y/) in parameters (a, i//) — for a polarized source in the presence of a known 
(polarized or unpolarized) background. Here we summarize the primary results. Along the way, we offer a few 
comments to clarify the results and their limitations. 

The starting point for deriving the bivariate PDF for the polarization of a source in the presence of a known (polarized 
or unpolarized) background, is the azimuthal density of A/ 2 , the difference in % between two sets of model parameters 
describing the (polarization signature) 2 cp modulation of counts with azimuth <p. 

4 ~Vx 2 {(p)= {Cs [«s cos (2 - y / s ))- ci s cos(2(^ - y / s ))] + C B [a B cos(2 [cp - y / B ))- a B cos(2(^ - y/ B ))]} 2 

d(p (3) 

- ( 2 ^ {C s [l + d s cos(2(^ - y / s ))] + C B [l + d B cos(2(^ - y / B ))])} • 

Here, the polarization parameters are the (fractional) modulation amplitude (a B and a B ) and position angle (y/ s and 
y/ B ) of the total counts from the source (C5) and from the background ( C B ), respectively. The caret over each 
corresponding symbol denotes the “true” expectation value of that parameter. 

First, in order to proceed mathematically and to obtain the standard expression for the bivariate PDF for the source 
polarization parameters, we assume that the statistical variance is azimuthally uniform — i.e., that the denominator of 
Equation (3) does not vary with cp. While this is strictly valid only for the double-null hypothesis that the source and 
background are each unpolarized, it is a reasonable approximation provided that the modulation amplitudes are small 
(a 2 « 1). Fortunately for the mathematics but unfortunately for the measurements, this is usually the case! With this 
assumption, it is straightforward to integrate over azimuth (p to obtain A/ 2 . 

Second, for simplicity, we consider the background to be perfectly known — a B — » d B and y/ B — > y/ B — such that the 
remaining parameters of interest are a$ and y/ s . This reduces the quadrivariate PDF to a bivariate PDF 
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which is normalized over the intervals 0 < a B <00 and 0 < y/s < 71 ■ 



Third, note that the modulation amplitude and the position angle of the background polarization no longer appear in the 
expressions for A/ 1 and for the consequent PDF, in that the background is taken to be perfectly known. However, 
background counts still contribute to the statistical noise in measuring the modulation amplitude. Specifically, the 
standard deviation for determining the modulation amplitude 

^ =4lo as = p{C s + C B )/c s , (5) 

with a as = tJ(C$ + C B )/C$ , the standard deviation for fitting the modulated data. This factor ofV2 , which has caused 
some confusion, essentially results from the ratio of the amplitude of modulation to its RMS (root mean square) value. 

The familiar (Rice distribution) univariate PDF for the modulation amplitude a$ follows Equation (4) by integrating 
over position angle y/ s . 
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where /q( x ) * s the modified Bessel function of the first kind and order 0. 


(6) 


The statistical metrics for polarization all depend upon a c , s (Equation 5), the standard deviation for the uncertainty in 
measuring the modulation amplitude a B . Integration of p as (a B | 0) — Rayleigh distribution — yields the well-known 
formula for the minimum detectible amplitude MDA CL at confidence level CL. 

MDA cl ee V- 2 ln(l - CL ) = J- 2 ln(l - CL ) p(C s +C B )/C S . (7) 

The S-sigma univariate uncertainties in the modulation amplitude and in the position angle, respectively, are 

& s (s)=Sx CTas =Sx a /2(C, + C b )/C,; (8) 

Sx ^. Sx V^v^F a. (9) 

2 a B 2a B 

It is important to recognize that the criterion for an accurate measurement is much more demanding than that for a 
secure detection. For example, if polarization is detected with a modulation amplitude equal to MDA 99 , the 

measurement is only at about the 3-sigma level ( 2 ln(l - CL) — » 3.03 for CL=0.99). Achieving a measurement 

accuracy of 10% (S=10), say, would require an order of magnitude more counts and correspondingly longer observation 
time. If one also seeks to measure accurately the spectral or temporal dependence of the polarization, the demands on 
observing time are even more severe. 



3. EXAMPLE: CRAB PULSAR 

We now apply the results to measurement of the polarization of the Crab Pulsar in the presence of the polarized signal 
from the Crab Nebula, which acts as a polarized background to measurement of the Pulsar. For this calculation, we 
assume the following: 

1 . The polarimeter is an imaging system with negligible instrumental background for the measurement. 

2. Most of the Pulsar emission is contained in a resolution element Q P , with total number of Pulsar counts C P in the 
resolution element and in the energy band of interest. 



3. The number of counts from the Nebula in the resolution element is roughly (Q/. / Q v ) C, v , where C v is the total 
number of counts from the Nebula in the solid angle Cl N (~ 1 arcmin 2 ) and in the energy band of interest. 

4. No additional systematic effects are significant. 


Under these assumptions, the minimum detectible polarization at a confidence level CL is 

A/TT'.D _ MDA CL,.P _ 2 ,]- ln(l - cl) ^Cp +(Cl P /Q. N )C N 
muf cl,/> - — rw rw 7 > 

such that the 99%-confidence (CL = 0.99) minimum detectible polarization is 

_ mda 99 ,p _ 4.292 yjCp +(Qp/Cl N )C N 
mup 99 ,p ~ — 77 77 7 • 

Wp Wp Cp 


( 10 ) 


( 11 ) 


In general, the expected instantaneous count rate C(t) of photons in the energy range Ej-E u 
energy -dependent net effective area A e ^(E) of the telescope system and the photon spectral flux 
source: 


depends upon the 
N E (E,t ) from the 


L -‘U 

C{t)= J dE \A tff {E)[M F XE,t)] p ). (12) 

E, 

In order to perform phase-resolved polarimetry of a pulsar, we must integrate count rates over a pulse-phase range (for 
(fo u , folded on the period t P . The number of pulsar counts in the energy range Ej -E u , over the phase range <for<fom for an 
observation of total duration t obs is then 


t tp^u , \ t tpK E u 

C P = -ft. \dt \C P {t)\= -ft- jdt jdE K ff (£)[^(£)] p }. (13) 

tp tpj, tp tp*, E, 

As the net duration of the observation over the selected pulse-phase range is At = t 0 \, s A(fo = t 0 \, s ((fo u - (fo/) , the 
corresponding number of (temporally constant) nebular counts accumulated over that pulse-phase range is 


C N = fdt \dE K ff (£)[A £ (£)] ;V }= t obs a -<fo,) \dE {^ eff (£)[,¥/,(£)] 

tp tp*, E, E, 


N 


(14) 


Finally, conversion of the (directly measurable) fractional modulation amplitude a P into a fractional polarization m P 
(or an MDA into an MDP) requires knowledge of the polarimeter’s modulation factor ju(e), appropriately averaged 
over photon energy E for the pulsar’s spectrum over the selected pulse-phase range: 


tp<j> u E u 

tpK Eu \dt \dE{v(E)A, ff (E)[N E {E,t)} p } 

(m) p = tj f ) "dt \dE {n(E)A en {E)[N E {E,t)] p }/C P = • (15) 

P tAl E ' '[dt \dE{A en (E)[N E (E,t)} p ) 

tp//, E j 


As an example, we adopt the effective area A e tf(E) and modulation factor /j(E) as functions of photon energy E for an 
imaging x-ray polarimeter explorer (IXPE) proposed in 2007-2008. The response of the polarimeter (Figure 1) governs 
the effective energy range {ErE,fo) of the measurement, which we set to 1. 6-6.0 keV. 





Figure 1: Performance characteristics of the IXPE polarimeter, as functions of the photon energy E. The left panel displays 
its effective area A eff (E); the right panel, its polarization modulation factor /j(E) — i.e., the fractional modulation amplitude 
for a 1 00% linearly polarized beam. 



0 500 1000 1500 2000 

Phase x 1000 


Figure 2: Phase-resolved multi-band data for the Crab Pulsar. The top two plots 8 display the x-ray photon index T x and 
photon spectral flux Ne at E x = 1 keV; the bottom two plots 7 , the visible-light polarization degree and position angle. The 
plots show data folded on the pulsar’s period ( t P = 34 ms) and repeated over two cycles for clarity. (Reproduced from 
reference [8].) 


For the phase-resolved x-ray spectra of the Crab Pulsar (Figure 2), we adopt the values for hydrogen column density 
N h , power-law norm N E (photon spectral flux at E x = 1 keV), and photon index T v from Chandra data s . For the Crab 
Nebula, we use the same hydrogen column density as for the Pulsar, but adopt the x-ray power-law norm and photon 
index from XMM-Newton data 9 . 

Figure 3 illustrates the dependence of MDP 99vP upon total observation time t 0 b S , for three features in the Crab Pulsar’s 
pulse profde (Figure 2): inter pulse (phase range 0.03-0.33); secondary pulse (phase range 0.33-0.43); and primary 
pulse (phase range 0.95-0.03). The plot compares the MDP 99/ > for these three features, using different assumptions 
about the imaging performance of the polarimeter: non-imaging (Nebula unresolved); 15-arcsec imaging; and perfect 
imaging. Clearly, an imaging capability greatly enhances the sensitivity for detecting polarization of point source 
against a background (either nebular or instrumental). 



Figure 3: 99%-confidence Minimum Detectible Polarization (MDP 99 ) for specified pulse-phase intervals of the Crab Pulsar, 
showing importance of imaging, especially away from pulse peaks. In order of decreasing MDP (increasing sensitivity), 
each family of curves shows the x-ray MDP 99P (t obs ) for the inter pulse (0.03-0.33), secondary pulse (0.33-0.43), and 
primary pulse (0.95-0.03). The solid-grey, solid-black, and dashed-black lines denote non-imaging, 15-arcsec-imaging, and 
perfect-imaging polarimeters, respectively. 

Similarly, Figure 4 exhibits the dependence of the uncertainty in a polarization measurement uj p upon total 

observation time t 0 bs , for Ar/>= 0.01 phase bins near three features in the Crab Pulsar’s pulse profile (Figure 2): mid inter 
pulse ($ P =0.15); secondary-pulse peak (^ P =0.40); and primary -pulse peak (^p P =0.00). The plot compares the cr n7p 

for these three features, for different assumptions about the imaging performance of the polarimeter: non-imaging 
(Nebula unresolved); 15-arcsec imaging; and perfect imaging. Clearly, an imaging capability greatly enhances the 
sensitivity for detecting polarization of point source against a background (either nebular or instrumental). For a pulsar, 
rapid changes in polarization amplitude and position angle (Figure 2 bottom two plots) necessitate accumulation of data 
in rather narrow pulse-phase bins, in order to obtain an accurate measurement of m P and y/ P and their pulse-phase 




dependence. Note that for a polarization measurement of 10% accuracy, the polarization amplitude would need to be 
10-sigma. For example, to measure 5% polarization with 10% accuracy requires a measurement uncertainty 
<j m < 0.5% . Even for an x-ray source as bright as the Crab pulsar, accurate phase-resolved polarimetry will be 

challenging. Without imaging, it would be barely feasible. 


1 parameter, 1 (Terrors 



Figure 4: Univariate 1 -sigma uncertainty a mP in measurement of the x-ray polarization amplitude ru P at representative pulse 
phases 0of the Crab Pulsar. As the x-ray polarization amplitude and position angle are likely to change rapidly with pulse 
phase (cf. Figure 2 bottom two panels), counts are accumulated in short pulse-phase bins (/>= 0.01. In order of decreasing 
polarization uncertainty (increasing sensitivity), each family of curves shows the x-ray CT mP near mid inter pulse ($ P = 
0.15), secondary-pulse peak (^ P = 0.40), and primary-pulse peak ($, P = 0.00). The solid-grey, solid-black, and dashed- 
black lines denote non-imaging, 15-arcsec-imaging, and perfect-imaging polarimeters, respectively. 


4. SUMMARY 

We have outlined the derivation of the uncertainties relevant to x-ray polarization measurements. In so doing, we have 
clarified the relationship between figures of merit — the minimum detectible polarization (MDP) and the statistical 
accuracy of a polarization measurement. We emphasize that the integration time required to achieve a certain statistical 
accuracy in a polarization measurement is different from, and may be substantially longer, than the time to achieve a 
particular MDP. We have also examined the impact of certain systematic effects — namely, a (polarized or unpolarized) 
background. 

Even if perfectly known through on-ground and on-orbit calibration, systematic effects may have a profound impact on 
sensitivity, possibly rendering certain measurements impossible. Consequently, we expect to extend our investigation to 
include other and more complicated systematic effects, as well as subtle corrections — such as nonuniform statistical 
variance — to the traditional statistical analysis. 
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APPENDIX A 


A polarimeter measures linear polarization by determining the amplitude and phase of modulation with azimuthal angle 
cp by analyzing the data for a 2cp signature. We write the azimuthal density of counts 3 (counts per unit azimuth) due to 
the source: 


— = ^[ ] + a S cos(2 (<p - Yg ))] , 
dcp 2 n 


(16) 


with C s the total number of source counts, ag=/Ugmg the fractional modulation amplitude for a polarization 
modulation factor pig and fractional (linear) polarization mg of the source, and Ys the corresponding polarization 
position angle. We write the analogous expression for the azimuthal density of counts due to the background, which 
may also be polarized: 


dC ^ = WT t 1 + a B cos(2 {cp - Yb ))] , 


2 n 


(17) 


with C B the total number of background counts a B - pi B m B the fractional modulation amplitude for a polarization 
modulation factor pi B and fractional (linear) polarization m B of the background, and y/ B the corresponding polarization 
position angle. 

Under the hypothesis that true azimuthal density of counts is 


= yM l + «S’ cos(2(<p - y s ))] + -^[l + ci B cos (2 - y / B ) 
dcp 2 k 2 k 

one can, in principle, derive the probability density function (PDF) 

p{ a S’V'S’ a B>V'B I&S’VS’QB’Vb) 


(18) 


(19) 


for the polarization modulation parameters ag,Yg,a B , and Yb , given their “true” expectation values (denoted by the 

caret). The derivation begins by considering A/ 1 , the change in chi-square in varying the parameters about their 
expectation values, which has an azimuthal density 


d U dCjtp ) | . dC(<p) 

dcp'' \ dcp dcp j dcp 


( 20 ) 


Thus, using the expression for the azimuthal density of counts (Eq. 18), we calculate the azimuthal density of A/ 2 : 


In sketching the derivation, we use a compact differential notation. If one bins the data into J azimuthal bins, each spanning an 
azimuth interval A <pj = (2n/J), such that the average counts per bin is <Cj> = ( C/J ), the value of A/ 2 is independent of J. Thus we 
somewhat heuristically take J— > 00. 




( 21 ) 


which simplifies to 

= \ C S [«s cos(2(^ ~Ys))~ “S cos(2(^ - y s ))] + C B [ci B cos(2 {<p ~Yb))~ a B eos{l((p - y> B ))]} 2 

- W ( C S D + a s cos(2(^ - y/ s ))] +C B [ 1 + a B cos(2(y? - y/ B ))])} 

= cos(2(^-(//s)) + C B a B cos,(l((p- Yb))] ( 22 ) 

- [c 5 a 5 COS (2 (^7 -Ys))+ c b“b cos(2 (cp - y/ B ))]} 2 

- W( [c s + C B }+ [c s a s cos(2 (<p ~Ys)) + Cb^b cos(2(tp - y > B ))])} . 

In general, the denominator, which describes the statistical variance (power in counting noise), varies with azimuthal 
angle cp. It is strictly constant with respect to (p only for the double null hypothesis — d B = 0 and a B = 0 — that neither 
the source nor the background is polarized. However, the error in A/ 2 resulting from neglecting the azimuthal variation 

in the statistical variance is of relative order a 2 , which is typically «1. Consequently, we approximate' 1 the above 
equation: 



d A/ 2 (ff>) _ {Cy [a 5 COS (2 {(p-Ys)) -d B cos(2(^> - Ys ))] + C B [a B cos(2 {cp -Yb))~ cos(2(^ - ip B ))]} 2 , 

dcp ~ 2 7r(C s +C B ) K 

Here, we assume that the background is perfectly known — a g — and Yb^Yb — such that the remaining 
parameters of interest are a B and Ys : 


d A/ 2 ((p-a s ,Ys j_«5i£s) 

dtp 
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[ a s cos(2(^ - Ys ))~ as cos ( 2 (<? - Ys ))f 
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a s 


(24) 


with a Qs = + C B /c$ setting the scale for fitting the fractional modulation of the signal in the presence of 

background. Note that, as the background is assumed to be perfectly known, the modulation amplitude and the position 
angle of the background polarization no longer appear in the expressions for A^ 2 and for the consequent PDF. However, 
background counts still contribute to the statistical noise in measuring the modulation amplitude and, hence, will 
diminish the sensitivity for measuring the polarization of the source. 


b 


Elsewhere we shall investigate consequences of the azimuthal variation of the statistical variance. As the mathematical expressions 
are quite complicated, we here restrict the analyses to cases for which the fractional modulation amplitude a is sufficiently small 
that the statistical variance may be regarded as unifonn. 



It is straightforward to integrate the above approximation over azimuth to obtain 

dq- 2 (g s , I 3 , , f + 5 s)~ a s*s cos ( 2(^5 ~ Vs )) _ (ag + ag )- 2 a s a s cos(2(^ 5 - ij > s )) 


(25) 


Here, 


-4la as = p{C s +C B )/C S 


(26) 


sets the scale for estimating the uncertainty in a $ , the fractional modulation amplitude of the signal in the presence of 
background. This potentially confusing factor-of-V2 difference between the uncertainty in determining the modulation 
amplitude o a ^ and the statistical standard deviation o a ^ in fitting the modulation signal basically represents the 

difference between the amplitude and RMS (root mean square) of a sinusoidal function — (cos^ («#)) = 1/2 . 

\ tin 

Using the trigonometric identity cos(2<9) =1-2 sin 2 (<9), we alternatively write 

(27) 

(7 (7 

a s a s 

Given the approximate expression for A/ 2 , we can now write the bivariate probability density function (PDF) p in the 
approximation discussed above, giving the familiar 1 ' 1 result: 
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where Pa s ,i// s ( a S > Vs , y /$ ) * s normalized over the intervals 0 <a s < oo and 0 < y/ s < n . 

Integrating the bivariate PDF over the position angle y/ s , we obtain the well-known univariate PDF for the modulation 
amplitude without regard to the position angle, described by the Rice distribution 1 1 : 

.(a s \a s )« C ‘ S 


Pa c 


-exp 


( 2 , 



+ a S ) 
2 o 2 


a S<*S 

o 2 

L a s J 


L °s J 


a s 


-exp 


(q| +ay) 

1 „ 

a S<*S 

2 [2 (C 5+ C g )/C|]_ 


[2 (C S+ C B )!C 2 ]_ 


(29) 


[2 (C S+ C B )/C 2 S ] 

where Iq(x) * s the modified Bessel function of the first kind and order 0. 



If the source is unpolarized, a$ — > 0 and the PDF for obtaining a modulation amplitude a B is described by the 
Rayleigh distribution: 


Pa 


.(«■> I°) ? 


a S 


exp 


2 

a S 

2 a 2 


da<i 


exp 


' 2 ' 

a S 

2 a 2 

v 


(30) 


Integrating this PDF from 0 to MDA CL and setting the integrand to CL, one solves for the minimum detectible 
(fractional) amplitude MDA CL at confidence level CL: 


MDA cl ee a V" 3 ln(l - CL) = +Cb \ -2 ln(l - CL) . (31) 

C S 

In general, integration of the PDF (Equation 28) allows exploration of the uncertainty in the parameters. If detection is 
significant at a high confidence level, a$ and y/g is each approximately normally distributed about its true value, such 
that the univariate uncertainty in the parameters for a given follows simply from Equation 27: 
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where we approximate sin(£(//) — > Sy/, valid for (Si//) « 1. The “one-sigma” (univariate) uncertainties in the 
modulation parameters are then 
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If the detection is not highly significant, the confidence contours are more complicated 12 . Flowever, when that is the 
case, it is not appropriate to expect an accurate measurement. 
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